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8, AIMS 


This document proposes an extension, for a period of three 
additional years, of the research carried out with CIAR 
support on the optimal design of laboratory experiments 
involving mutagenicity tests of chemical components. Our 
investigations to date have revealed that the problem of I 

overdispersion of binomial and Poisson data . resurrected by ! 

us after twenty years of dormancy, is intrinsic to the 
consideration of experimental-design optimality involving 
chromosomal aberrations . Moreover, the current literature in 
genetics reveals that this problem is relevant and important 

tQ _thgse_ ggjas i Lci&ts ...wftrMng_ wi t h l arge , . a ggr e g a ted .data 

sets . And our own work has shown that this problem is almost 

certainly susceptible to solution by new and powerful __ 

mathematical and statistical techniques* 

The tractability of these formerly intractable methods is 
due, in great part, to computer technology which has evolved 
in parallel with the collection and aggregation of large 
quantities of biological data. Techniques of analysis of 
such data that are currently in vogue - meta-analysis. 

Poisson-regression, etc. - can no longer rely on the usual 
simplifying assumptions about underlying distributional 
properties of the aggregated data. Instead, new 

distributions that characterize the aggregated data more 
realistically must be considered and applied. To date, we 
have developed a number of such distributions (1,2,3), and we 
are in the process of developing others. We will be applying 
these distributions to large aggregations of data on human 
chromosomal aberrations (10,11,12,13). We will also examine 
the relevancy of our new distributions to recent developments 
jjCLJRoLsson-r egress ion . Our findings will be applicable to 
similar biological endpoints said to result from exposure to 
environmental tobacco smoke . 
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9 , SIGNIFICANCE OF PROPOSED WORK; 


The problem of overdispersion of binomial and Poisson data is 
intrinsic to the consideration of experimental-design 

QPt iRftli-ty involving_bqsan.jehr- p gp gQ ia^l_ aberrations . Recent 

literature on chromosomal aberrations reveals that this 
problem is recognized by geneticists who work with large, 
aggregated data sets(13). They know that statistical 
analyses to determine the significance of differences in 
chromosomal aberrations between groups, or experimental 
treatments, are based on the assumption of an underlying 
Poisson distribution(16,17) of the data. The validity of 
such analyses rests on the condition that aberrations in 
"control cells", (untreated, normal), do indeed follow the 
Poisson distribution* However, because of the low frequency 
with which such aberrations arise, this condition has seldom, 
if ever, been adequately tested* Now, with recently- 
aggregated, large data sets (11.12.13) such checks on the 
distributional _ properties _ of cbroaosowal _ aberrations _ar e 

practical and feasible . _ 

In human cytogenetics, the collection of large numbers of 
cells from many individuals involves the participation of 
many technicians, working in different laboratories, under 
varying conditions, for long periods of time. All of these 
factors tend to contribute additional variation, over and 
above Poisson, to the aggregated numbers upon which 
statistical tests are ultimately performed* If the 

aggregated data are assumed to follow a Poisson distribution, 
the variances are taken to be equal to the means. This fact 
becomes important in weighted regression analysis and in 
hypothesis testing* In weighted regression, the weights are 
generally taken as the inverse of the variance, which, in the 
case of Poisson variables is equal to the mean. If * by 
virtue of overdispersion. the weights are too large* the 
resulting analysis will be distorted. 


In the realm of hypothesis testing, the variance, or a 
function thereof, is featured as the denominator of a ratio, 
whose numerator is a function of the mean* If additional 
variation, introduced by the factors mentioned above, is not 
taken into account, the denominator will be smaller than it 
should be, and the resulting ratio will be larger. This will 
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result in a test statistic that rejects the null hypothesis 
more often than it should* In the parlance of statistics, 
the Type I error will be increased, and the level of 
significance will be decreased* For example, if the level of 
significance is taken as 95*, researchers will conclude that 
&...."giitniff cyei^, (p<,0,05)_ n _.exists.,. none ftiJjfcn.Ah&R 
they should* It also follows that test statistics that are 
designed to test for adverse effects of environmental hazards 
(using dose-response analysis) may overstate the significance 
of effects if overdispersion is present in the data* 


These problems and others are alluded to in recent literature 
(13,14) on observed chromosomal aberrations said to be 

induced by ionizing radiation* In fact, the same problems 
would arise when other clastogens are tested for their 
alleged ability to induce chromosomal aberrations* This 
concern is especially relevant to chromosomal aberrations* 
includingsister chromatid exchanges in humans 

.( IB uJJSuMtZl.% 2 % ) sa id_ to be induced by enyironmental^obacco 

sm oke 
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10-11. PRELIMINARY STUDIES and EXPERIMENTAL PLAN; 


To date, we have submitted for publication two manuscripts 
(1*2), and a third (3) is in. its final stages of preparation* 
Copies of these documents have been forwarded to CIAR as 
reports of progress of our research accomplishments since 
January 1990. 

Our intent now is to establish close working relationshi ps 
with other investigators * whose interests in specific aspects 
of our project will result in collaborations that are of 
mutual benefit. In particular. Dr. Kastenbaum will establish 
a collaborative working relationship with Dr. Michael A 
Bender of the Brpokhaven National Laboratory . Dr. Bender 
possesses and has access to large data sets on human 
chromosomal aberrations said to be induced by exposure to an 
assortment of chemical and radiological agents. Analyses 
have been performed on some subsets of these data, and the 
results have been reported in the literature (11,12,13). 
Much work remains to be done, however, specifically in 
clarifying the various causes of overdispersion associated 
with some chromosomal aberrations and not with others. Such 
clarification will make the new and complex distributions, 
developed by us, more meaningful to the geneticist . Dr. 
Kastenbaum plans to collaborate with Dr. Bender in this 
undertaking. 

Some of the other questions that will be addressed deal with 
the nature of the aggregated data, proper approaches to their 
analysis, and p i j U 9 g.pp h ,ica l jyra cce ptable .f ound atjons lor.Jh.e. 
inferences drawn from such analyses. Such considerations of 
these aggregated data will be immediately relevant and 
applicable to similar data collected in the quest for alieged 
genetic effects of environmental tobacco smoke. 
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At the same time, Dr. Bowman will establish a closer working 
relationship with Dr. E. L. Frome . her colleague at the Oak 
Ridge National Laboratory . Dr. Frome has published a number 
of papers on the subject of Poisson-regression (4,5,6,7,8,9) 
and is an acknowledged expert and leader in this field. The 
relevance of this collaboration to our proposed project is 
readily apparent in some of Dr. Frome*s publications. His 
work encompasses the application of Poisson regression 
techniques to log-1 inear, quasi1inear«and nonlinear models. 
as well as to epidemiologic follow-up data o rganized into a 
life-table t ype format. Overdispersed Poisson distributions 
would affect such analyses by virtue of expected changes in 
the covariance matrix. 


One specific application of Frome's work,(6), is a reanalysis 
of the Doll-Hi11 data on lung cancer deaths among British 
physicians. Of particular interest in Frome's analysis is 

his findine that the treatment of _ data on nonsmokers is of 

critical importance to the results of the analysis. Frome 
demonstrates that estimates of the background death-rate when 
nonsmoker data are excluded from the analysis, is M seven 
times lower than the estimate obtained when data for 
nonsmokers are included." These findings, on the impact of 
nonsmoker data on the results of epidemiological analyses of 
the alleged effects of tobacco smoke, are directly relevant 
to CIAR concerns with the biological effects on nonsmoking 
humans that are said to result from environmentjU^ol^c^o 
smoke . 

We will also continue to collaborate with Dr. L.Ro.v Shenton . 
(U of GA). His interests in practical analyses of 
statistical models, especially those dealing with estimation 
problems associated with distributions, has spanned many 
years, and, in collaboration with Dr. Bowman, has produced 
new results using computer-oriented approaches. 
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12-12A. AVAILABLE FACILITIES AND RESOURCES &._QTHBR_.SUPPORT:. 


This research will be carried out in collaboration with K.O. 

Bo ymm _ of the_Maihemat_i_caI_ Sciences Section-Oak Ridge 

National Laboratory. 


The Oak Ridge National Laboratory is a multi-purpose research 
institution that includes an Environmental Sciences Division 
and a Biology Division* The Mathematical Sciences Section of 
the Engineering and Mathematics Division employs about 35 
research personnel with diversified expertise. 






N 

O 

JO 

C3 

S 

01 


Source: https://www.indust rydocu ments.ucsf.edu/docs/nnlmOOOO 





MARVIN A. KASTENBAUM 


RESEARCH ABSTRACT 

Title of Project: TREATMENT OF OVERDISPERSED, AGGREGATED DATA 
ON HUMAN CHROMOSOMAL ABERRATIONS 

Investigators: K.O. Bowman and Marvin A. Kastenbaum 
Institution: Oak Ridge National Laboratory (Bowman) 

ABSTRACT; 


Our investigations to date have revealed that the problem of 
overdispersion of binomial and Poisson data is intrinsic to 
the consideration of experimental-design optimality involving 
chromosomal aberrations alleged to be induced by chemical 
components in the ambient air. Moreover, the current 
literature on chromosomal aberrations indicates that the 
existence of this problem is recognized by geneticists 
working with large, aggregated data sets# Our research has 
shown that this problem is almost certainly susceptible to 
solution by new and powerful mathematical and statistical 
techniques. Current methods of analysis - meta-analysis, 
Poisson-regression, etc#- rely on untested, simplifying 
assumptions about the underlying distributional properties 
that may not portray the aggregated data accurately. We have 
developed new, more realistic characterizations (distribution 
types) that we propose to apply to large aggregations of data 
on human chromosomal aberrations. Whatever our findings, 
they will be applicable to similar biological endpoints said 
to resultfrom exposure to environmental tobacco smoke . 
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